Estonian Copular and Existential Constructions as an UD Annotation Problem
نویسندگان
چکیده
This article is about annotating clauses with nonverbal predication in version 2 of Estonian UD treebank. Three possible annotation schemas are discussed, among which separating existential clauses from copular clauses would be theoretically most sound but would need too much manual labor and could possibly yield inconcistent annotation. Therefore, a solution has been adapted which separates existential clauses consisting only of subject and (copular) verb olema be from all other olema-clauses.
منابع مشابه
An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملElliptic Constructions: Spotting Patterns in UD Treebanks
The goal of this paper is to survey annotation of ellipsis in Universal Dependencies (UD) 2.0 treebanks. In the long term, knowing the types and frequencies of elliptical constructions is important for parsing experiments focused on ellipsis, which was also our original motivation. However, the current state of annotation is still far from perfect, and thus the main outcome of the present study...
متن کاملCopular Complements: Closed or Open?
In this paper, we present some preliminary observations on the syntactic analysis of copular constructions in LFG. We suggest several conclusions and directions for future research. First, the open analysis is appropriate for some copular constructions but not for others, even within the same language. Second, the same construction can be open in some languages and closed in others. Third, we c...
متن کاملArborest – a VISL-Style Treebank Derived from an Estonian Constraint Grammar Corpus
Treebank creation is a very labor-consuming task, especially if the applications intended include machine learning, gold standard parser evaluation or teaching, since only a manually checked syntactically annotated corpus can provide optimal support for these purposes. There are, however, possibilities to make the annotation process (partly) automatic, saving (manual) annotation time and/or all...
متن کاملCopular constructions and adjectival uses of bare nouns in French: a case of syntactic recategorization?
This paper deals with three copular constructions in French that take bare nominals as a predicative complement (attribut du sujet). From a lexical point of view, these constructions, which are typical of colloquial French, are very open frames. After a detailed analysis of the syntactic and semantic properties of these constructions, I will examine them in the light of a more theoretical quest...
متن کامل